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Abstract 

Let A n _i denote the (n — l)-dimensional simplex. Let Y be a 
random d-dimensional subcomplex of A n _i obtained by starting with 
the full (d — l)-dimensional skeleton of A n _i and then adding each 
(i-simplex independently with probability p = —. We compute an 
explicit constant 7^ = S(logd) so that for c < 7^ such a random 
simplicial complex either collapses to a (d— l)-dimensional subcomplex 
or it contains SA^+i, the boundary of a {d + l)-dimensional simplex. 
We conjecture this bound to be sharp. 

In addition we show that there exists a constant 7^ < < d + 1 such 
that for any c > Cd and a fixed field F, asymptotically almost surely 
H d (Y;¥)^0. 

1 Introduction 

Let G(n,p) denote the probability space of graphs on the vertex set [n] = 
{1, . . . ,n} with independent edge probabilities p. It is well known (see e.g. 
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[1]) that if c > 1 then a graph G G G(n, £ ) a.a.s. contains a cycle, while for 
c < 1 



,c c 2 



lim Pr [ G 6 G(n, -) : G acyclic } > y/l-c ■ exp(- + — ). (1) 

In this paper we consider the analogous question for (i-dimensional random 
complexes. There are two natural extensions to the notion of an acyclic 
graph. Namely, the vanishing of the d-th homology, and collapsibility to a 
(d — 1) -dimensional subcomplex. These are the two questions we consider 
here. We provide an upper bound on the threshold for the vanishing of the 
d-th homology and a lower bound (which we believe to be tight) for the 
threshold for collapsibility. 

For a simplicial complex K, let denote the i-dimensional skeleton of 
Y . Let Y(i) be the set of i-dimensional simplices of Y and let fi(Y) = \Y(i)\. 
Let A n _i denote the (n — l)-dimensional simplex on the vertex set V = [n]. 
For d > 2 let Y d (n,p) denote the probability space of complexes C 
Y C A^f2i with probability measure 



p r (y) =pf*<y)(i 



p 



-fd(Y) 



Let F be an arbitrary fixed field and let H^Y) = Hi(Y;¥) and H^Y) = 
H l (Y;¥) denote the z-th homology and cohomology groups of Y with coef- 
ficients in F. Let h^Y) = dim F i^(V) and h\Y) = dim ¥ H^Y). Kozlov [3] 
proved the following 

Theorem 1.1 (Kozlov). For any function u(n) that tends to infinity 

ui(n) 

lim Pr [ Y E Y d (n,p) : H d {Y) ^ ] = { „ ^ ~\ 




u(n)n ' 

Our first result improves the upper bound in Theorem 1.1. Let 

g d (x) = (d + l)(x + l)e~ x + x(l - e - x ) d+1 

and let c d denote the unique positive solution of the equation g d {x) — d+ 1. 
A direct calculation yields that c d = d + 1 — 0( Jf). 

Theorem 1.2. For a fixed c> c d 

lim Pr [Y e Y d (n, -) : H d (Y) ^ 0] = 1 . (2) 

n— s-oo 77, 
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Remark: In the 2- dimensional case, Theorem 1.2 implies that if c > c 2 ~ 
2.783 then Y e 7 2 ( s. satisfies H 2 (Y) ^ 0. Simulations indicate that 

the actual threshold is somewhat lower (around 2.75). 

We next turn to collapsibility. A (d — 1)— dimensional simplex r G 
A n -x(d — 1) is a free face of a complex y C if it is contained in a 

unique a G Y(d). Let -R(K) denote the complex obtained by removing all 
free (d — l)-faces of Y together with the c?-simplices that contain them. We 
say that R(Y) is obtained from Y by a d-collapse step. Let Ro(Y) = Y 
and for i > 1 let Ri{Y) = R(Ri-i(Y)). We say that Y is d-collapsible if 
dimi?oo(y) < d. Costa, Farber and Kappeler [2] proved the following 

Theorem 1.3 (Costa, Farber and Kappeler). // u(n) — > oo then Y G 
Y 2 (n, k,(^) n ) is a.a.s. 2 -collapsible. 

Our second result improves Theorem 1.3 and the lower bounds in Theo- 
rem 1.1 as follows. Let 

itd(7,x) = exp(-7(l - x) d ) - x . 

For small positive 7, the only solution of 1^(7, x) — is x — 1. Let 7^ be the 
infimum of the set of all nonnegative 7's for which the equation 1^(7, x) = 
has a solution x < 1. More explicitly, 7^ = (c/x(l — x)^ -1 ) -1 where a; satisfies 
exp(— ^r) = x. It is not hard to verify that this yields 

7 d = logd + (log log d). 

Let J- n> d denote the family of all GYg A„2 1 that do not contain the 

boundary of a (d+ 1)— simplex. 

Theorem 1.4. Let c < 7^ be fixed. Then in the probability space Yd(n, ~) 

lim Pr [Y is d — collapsible \ Y G J- n ,d] = 1 • (3) 

Remark: We have calculated 72 — 2.455, and computer simulations suggest 
that this is indeed the actual threshold for collapsibility for random com- 
plexes in T n p- Also, 73 ~ 3.089,74 ~ 3.508 and 7 100 ~ 7.555. 

Clearly, if Y is <i-collapsible then Y is homotopy equivalent to a {d — 1)- 
dimensional complex, and in particular Hd(Y) = 0. Hence 
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lim Pr \H d {Y) = | Y G F n , d \ = 1 

n— >oo 

and 

c d+2 

lim Pr [H d (Y) = 0] = lim Pr[F n4 ] = exp(- ) . 

The paper is organized as follows. In Section 2 we prove Theorem 1.2. In 
Section 3 we analyze a random d-tree process that underlies our proof that 
for c < 7^, a random F G F d (n, £ ) PI J 7 ^^ IS ct.ct. s. (i-collapsible. Another 
main ingredient of the proof is an upper bound on the number of minimal 
non (i-collapsible complexes given in Section 4. In Section 5 we combine 
these results to derive Theorem 1.4. We conclude in Section 6 with some 
comments and open problems. 

2 The Upper Bound 

Let F e Y d (n,p). Then h^Y) = for < i < d - 1 and /;(F) = (.^) for 
< % < d — 1. The Euler-Poincare relation 

i>0 i>0 

therefore implies 

The inequality h d (Y) > f d (Y) — already implies that if c > d + 1 

then a.a.s. /^(F) 7^ 0. This was also observed in the 2-dimensional case by 
Costa, Farber and Kappeler [2]. The idea of the proof of Theorem 1.2 is to 
improve this estimate by providing a non-trivial lower bound on E[h d -i\- For 
r e A n _i(d - 1) let 

deg Y (r) = \{ ( reY(d):TCa}\ 

and let 

A T = {Y eY d (n,p):deg Y (T) = 0} . 
For a G Y{d) let L a be the subcomplex of a^ d ~^ given by 

L a = a {d ' 2) U {r G a {d ' 1] : deg y (r) > 1} . 
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Let P n> d denote the family of all pairs (a, L), such that a G A n _i(d) and 
a {d-2) cL( - a (d-i)_ For e p n d let 

Pv,l = e ^(n, p) : a G y , L CT = L} . 

The space of z-cocycles of a complex K is as usual denoted by Z l (K). The 
space of relative z-cocycles of a pair K' C A' is denoted by Z l (K,K ; ) and 
will be identified with the subspace of z-cocycles of A that vanish on K' . Let 
z'(A') = dimZ^K) and z\K,K') = dim Z { (K, K'). For a (d - l)-simplex 
r = [vq, . . . , let 1 T be the indicator (d — l)-cochain of r (i.e. l T (r/) = 

sgn(7r) if rj = [i>7r(o), • • • , v n(d-i)] and is zero otherwise). If r G A„_i(d — 1) 
then Z d ~ l {r) is the 1-dimensional space spanned by l r . If (<t, L) G P^ and 
fd-i(L) = j, then z d ~ 1 {a, L) = d — j. indeed, suppose a = [vq, . . . , va] and 
for < % < d let n = [v Q , . . . ,v h . . . ,v d }. If L(d - 1) = {ri} d =d _ j+1 then 
{1 T0 — l Ti }iZi forms a basis of Z d ~ 1 (a, L) . 

Claim 2.1. For any Y G Yd(n,p) 
Z d - 1 {Y)D Z d -\r)® Z d -\a,L) . 

Proof: The containment is clear. To show that the right hand side is a direct 
sum note that nontrivial cocycles in different summands must have disjoint 
supports and are therefore linearly independent. 

□ 

Let 

o(y) = |{reA n _ 1 (d-l) : deg y (r) = 0}| 
and for < j < d let 

a 3 {Y) = \{{a,L) G P M : Y G B a>L , f d ^(L) = 

Note that atj(Y) is the number of d-faces of Y that contain exactly d + 1 — j 
(d — l)-faces of degree 1. By Claim 2.1 

d 

z d ~\Y)>u(Y)^a(Y) + Y,aj(Y)(d-3) ■ (5) 

j=0 
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As h d ^(Y) = h d ~ l {Y) = z d " l (Y) - it follows from (4) and (5) that 

h d (Y)>v(Y)^f d (Y) + u(Y)-(fj . (6) 

Theorem 1.2 will thus follow from 

Theorem 2.2. Let c> c d and let p = -. Then 

lim Pr [ Y G Y d (n,p) : v{Y) < 0] = . 

n— >oo 

Proof: First note that 



E[a] = \dj {1 ~ p) dT n °^ 

and for < j < d 



d+lj\ j 
n d c fd+l 



Therefore 



d 

E[u] = E[a] + Y,E[a J }(d - j) = 

3=0 



3=0 

n d e~ c n d c fd+l 



d\ (d + l)\. 



ii 



d 



((1 + c)(d + l)e~ c - c(l - (1 - e- c ) d+1 )) - OK" 1 ) 



(d+1)! 
It follows that 

E[v)=E[f d ] + E[u] 



(c + (1 + c)(d + l)e" c - c(l - (1 - e" c ) d+1 ) - (d + 1)) - O^- 1 ) 



(d+iy. 

6 
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d 



-(g d (c)-d-l)-0{n d - x ) 



(d+1) 

Since c > q it follows that for sufficiently large n 

E[v] > en d (7) 

where e > depends only on c and d. To show that v is a.a.s. positive we 
use the following consequence of Azuma's inequality due to McDiarmid [5] . 

Theorem 2.3. Suppose f : {0, l} m — > E satisfies \f(x) — f(x')\ < T if x and 
x' differ in at most one coordinate. Let £i, . . . , £ m be independent 0, 1 valued 
random variables and let F = . . . , £ m ). Then for all A > 

Pr[F < - A] < exp(-— ) . (8) 

Let m = ( d ? x ) and let <7i,...,<7 m be an arbitrary ordering of the d- 
simplices of A n _i. Identify F G with its indicator vector . . . , £ m ) 

where &(F) = 1 if ^ G F and &(F) = otherwise. Note that if F and F' 
differ in at most one ci-simplex then \a(Y) — a{Y')\ < d + 1 and |<x,(F) — 
a,(X')\ < d + 1 for all < j < d. It follows that \v(Y) - v(Y')\ < T = 2d 3 . 
Applying McDiarmid's inequality (8) with F = v and A = E[v\ it follows 
that 

Pr[ V < 0] < exp(- ? ^l) < exp^C^" 1 ) 
for some C 2 = C 2 (c, d) > 0. 

□ 



Remark: The approach used in the proof of Theorem 1.2 can be extended as 
follows. For a fixed £, let Z d ^ 1 (Y) C Z d ~ 1 (Y) denote the subspace spanned 
by (d — l)-cocycles (f> G Z d ~ l (Y) such that \supp(<p)\ < I. Let 



9 d Ax) = lim 



n—i-oo 



Ejdim Z d ~\Y)} 





where the expectation is taken in the probability space Y d (n, -). For example, 
it was shown in the proof of Theorem 1.2 that 6^1(2) = e~ x and 

9 dt2 (x) = (1 + x)e-* - -^-(1 - (1 - e^) d+1 ). 

a + 1 
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Let x = ca/ denote the unique positive root of the equation 

x + (d + l)0d,e( x ) = d + l. 
The following fact is implicit in the proof of Theorem 1.2. 
Proposition 2.4. For any fixed c > Cdi 

lim Pr [Y G Y d (n, -) : H d (Y) ^ 0] = 1 . (9) 

n—>oo n 

Let dd = lim^oo c^t- It seems likely that ^ is the exact threshold for the 
vanishing of H d (Y). This is indeed true in the graphical case d = 1. 

Proposition 2.5. 

c 1 = l. 

Proof: For a subtree K = (Vk, Ek) on the vertex set Vk C [n] let denote 
all graphs G G G(n,p) that contain if as an induced subgraph and contain 
no edges in the cut (Vk, Vk)- The space of 0-cocycles Z°(K) is 1-dimensional 
and is spanned by the indicator function of Vk- As in Claim 2.1 it is clear 
that for G G G(n,p) and a fixed I 

Z°(G) D Z°(K). 

{K:\V K \<1 and GeA K } 

Hence for p = - 

E[dim Z^(G)] = '^^{Pr[A K } : K is a tree on < I vertices} = 

k=i v 7 

^ k^ ^ ft ^ k^ ^ 

fc=i ■ fc=i 

Let T(z) = XlfcLi HtF 2 ^ be the exponential generating function for the num- 
ber of trees. Then 

E[dimZ° £) (G)} _ T (xe- X ) 



lim 9n(x) = lim 



n x 



Therefore c\ = lim^oo c\ i is the solution of the equation 



i + 2T( ££! ) =2 (1Q) 

X 



Let R(z) = YlkLi ^~W~ zk ^ e ^ ne exponential generating function for the 
number of rooted trees. It is classically known (see e.g. [6]) that R(z) = 
zexp(R(z)), and that T(z) = R{z) - \R{zf . It follows that i2(e _1 ) = 1 and 
T(e _1 ) = ~. Hence c\ = 1 is the unique solution of (10). 

□ 



3 The Random d-Tree Process 

A simplicial complex T on the vertex set V with \V\ = I > d is a d-tree 
if there exists an ordering V = {v±, . . . , v?} such that lk(T[w 1; . . . , vi\, Vi) is 
a (d — l)-dimensional simplex for all d + 1 < i < I. Let Gt denote the 
graph with vertex set T(d — 1), whose edges are the pairs {r 1; r 2 } such that 
7~i U r 2 G T(d). Let dzst T (ri, r 2 ) denote the distance between n and r 2 in the 
graph Gt- 

A rooted d-tree is a pair (T, r) where T is a <i-tree and r is some (d— l)-face 
of T. Let r be a fixed (d — l)-simplex. Given k > and 7 > we describe 
a random process that gives rise to a probability space Td(k, A) of all rf-trees 
T rooted at r such that distxir, r') < k for all r' G T. The definition of 
Td(k,X) proceeds by induction on k. 7d(0, 7) is the (d — l)-simplex r. Let 
k > 0. A d-tree in 7d(A; + 1,7) is generated as follows: First generate a 
T G 7d(/c, A) and let IA denote all r' G T(d — 1) such that dist^T, r') = k. 
Then, independently for each r' G W, pick J = J T > new vertices ^i, . . . ,zj 
where J is Poisson distributed with parameter 7, and add the d-simplices 
zit', . . . , zjt' to T. 

We next define the operation of pruning of a rooted d-tree (T, r). Let 
{ri, . . . , T{} be the set of all free (d— l)-faces of T that are distinct from r, and 
let a - , be the unique d-simplex of T that contains r». The d-tree T" obtained 
from T by removing the simplices ri, <7i, . . . , T£, an is called the 'pruning ofT. 
Clearly, any T G 7d(/c + l, 7) collapses to its root r after at most k+1 pruning 
steps. Denote by Cd(k + 1,7) the event that T G 7d(k + 1,7) collapses to r 
after at most k pruning steps, and let pd(k, r y) = Pr[Cd(k + 1,7)]. Clearly, 
Pd(0,7) is the probability that T G 7d(l,7) consists only of r, hence 

p d (0, 1 ) = e~\ (11) 
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Let o"i, . . . , Gj denote the <i-simplices of T 6 Tdik + 1, 7) that contain r and 
for each 1 < i < j let rjn, . . . , 77^ be the {d — l)-faces of o~i that are different 
from r. Let Tu G Td(k, A) denote the subtree of T that grows out of r\a- 
Clearly, T collapses to r after at most k pruning steps iff for each 1 < i < j, 
at least one of the ci-trees Tu collapses to its root r\a in at most k — 1 steps. 
We therefore obtain the following recursion: 

00 

P*(k, 7) = E Pr [ J = ^'K 1 " ( x " " 1 ^)) d ) i = 
3=0 

00 j 

^Tj-e^l- (1-^-1,7))^' = 
i=o J ' 

exp(- 7 (l- Pd (A;-l,7)) d ) . (12) 

Equations (11) and (12) imply that the sequence {pd(k, 7)}^ is non- decreasing 
and converges to Pd(j) G (0, 1], where ^(7) is the smallest positive solution 
of the equation 

u d (-y,x) = exp(- 7 (l - a;) d ) - x = 0. (13) 

If 7 > is small, then ^^(7) = 1. Let 7^ denote the infimum of the set of non- 
negative 7's for which Pdi'j) < 1. The pair (7,2;) = (7^, Pd(ld)) satisfies both 
Ud{ld, Pd{ld)) = and 7^(7^ Pd(ld)) = 0. A straightforward computation 
shows that 7^ = (dx(l — x) d_1 ) _1 where x = p d {ld) is the unique solution of 

ex P("^) = x - 

4 The Number of Non-rf-Collapsible Complexes 

When we discuss d-collapsibility, we only care about the inclusion relation 
between ci-faces and (d — l)-faces. Therefore, in the present section we can 
and will simplify matters and consider only the complex that is induced from 
our (random) choice of d- faces. Namely, for every i < d, a given z-dimensional 
face belongs to the complex iff it is contained in some of the chosen <i-faces. 

A complex is a core if every (d — l)-dimensional face belongs to at least 
two simplices, so that not even a single collapse step is possible. 

A core complex is called a minimal core complex if none of its proper 
sub complexes is a core. 
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The main goal of this section is to show that with almost certainty there 
are just two types of minimal core subcomplexes that a sparse random com- 
plex can have. It can either be the boundary of a (d + l)-simplex, dAj+i, 
or it must be very large. Obviously this implies that there are no small non- 
collapsible subcomplexes which do not contain the boundary of a (d + 1)- 
simplex. 

Theorem 4.1. For every c > there exists a constant 5 = 5(c) > such that 
a.a.s. every minimal core subcomplex KofYE Y d (n, -) with fd(K) < 5n d , 
must contain the boundary of a (d + 1) -simplex. 

Henceforth we use the convention that faces refer to arbitrary dimensions, 
but unless otherwise specified, the word simplex is reserved to mean a <i-face. 

Our proof uses the first moment method. In the main step of the proof 
we obtain an upper bound on C d (n,m), the number of all minimal core d- 
dimensional complexes on vertex set [n] = {1,2, ... ,n}, which contain m 
simplices. 

Two simplices are considered adjacent if their intersection is a (d— 1)- face. 

If A U B is a splitting of a minimal core complex, then there is a simplex in 
A and one in B that are adjacent, otherwise the corresponding subcomplexes 
are cores as well. Therefore K can be constructed by successively adding a 
simplex that is adjacent to an already existing simplex. This consideration 
easily yields an upper bound of n d+m on Cd{n,m). The point is that if 
m = 5n d for 5 > small enough, we get an exponentially smaller (in m) 
upper bound and this is crucial for our analysis. 

Lemma 1. Let m = 5n d and 5 > small enough. Then 



pending on whether it is covered by at least bn (d — l)-faces or less. The 
sets of heavy and light (d — 2)-faces are denoted by H d _ 2 and L d _ 2 repec- 
tively. We claim that |-£/d-2| < To see this note that each simplex 

contains exactly d + 1 (d — l)-faces, but the complex is a core, so that each 
(d — l)-face is covered at least twice. Consequently, our complex has at most 
m( - (d— l)-faces. Likewise, each (d— l)-face contains d (d— 2)-faces. Each 




(14) 



Proof. Let b = ( ( - j . A (d — 2)-face is considered heavy or light de- 




ll 



heavy (d — 2)-face is covered at least bn times and the claim follows by the 
following calculation: 



H d . 2 \ < 



d(d + l)m 



2bn 



d(d+l)5n d 
2b~n 



d(d + l)5n 



2b 



d-l 



We extend the heavy/light dichotomy to lower dimensions as well. For 
each < i < d — 3, an i-face is considered heavy if it covered by at least b ■ n 
heavy (i + l)-faces. Otherwise it is light. The sets of heavy/light 2-faces are 
denoted by Hi resp. Lj. By counting inclusion relations between heavy faces 



The set of z-dimensional heavy (resp. light) faces contained in a given 
face a is denoted by iff be (resp. Lf). 

The bulk of the proof considers a sequence of complexes Ci, . . . , C m = C, 
where the complex Cj is obtained from Cj_i by adding a single simplex. A 
{d— l)-face a of Cj can be saturated or unsaturated. This depends on whether 
or not every simplex in C m that contains a already belongs to Cj. Prior to 
defining the complexes Cj, we specify the set of heavy (d — 2)-faces in one of 

at most L^ff 1 )^) possible ways. Note that this choice uniquely determines 
the sets of heavy faces for every dimension < i < d — 3. We start off 
with the complex C±, which has exactly one simplex. Clearly there are 
possible choices for C\. We move from Cj_i to Cj by adding a single simplex 
ti, which covers a chosen unsaturated [d — l)-face <7j_i of Cj_i. Our choices 
are subject to the condition that every heavy (d — 2)-face in C m is one of 
the heavy [d — 2)-faces chosen prior to the process. In other words, we must 
never make choices that create any additional heavy faces in addition to 
those derived from our preliminary choice. Our goal is to bound the number 
of choices for this process. 

The crux of the argument is a rule for selecting the chosen face. Associ- 
ated with every face is a vector counting the number of its heavy vertices, 
its heavy edges, its heavy 2-faces etc. The chosen face is always lexicograph- 
ically minimal w.r.t. this vector, breaking ties arbitrarily. A [d — l)-face all 
of whose subfaces are light is called primary. 

In each step j we expand a (d — l)-face a to a simplex o~Uy. Such a step 
is called a saving step if either: 
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1. The vertex y is heavy. 

2. There exists a light (d — 2)- subface r C a such that r U y is contained 
in a simplex in Cj_i. 

3. There exists a light subface t G a such that the face r U y is heavy. 

Note that the number of choices of y in the first case is < \H \ < 
(d — 1)! • b ■ n. In the second case the number of choices for y is at most 
dbn. In the third case there are d — 2 possibilities for the dimension of the 
light face and for each such dimension i there are at most bn choices for 
y. In all cases the number of choices for y is at most < d d bn. A step that is 
not saving is considered wasteful. For wasteful steps we bound the number 
of choices for y by n. 

The idea of the proof is that every such a process which produces a 
minimal core complex must include many saving steps. More specifically, we 
want to show: 

Claim 4.2. For every d 3 wasteful steps, at least one saving step is carried 
out. 

Proof. The proof of this claim consists of two steps. We show that there 
is no sequence of d(d — 1) consecutive wasteful steps, without the creation 
of an unsaturated primary face. Also, the creation of d + 1 primary faces 
necessarily involves a saving step. 

If u is a vertex in a (d — l)-face a, let rf(u) be the number of heavy 
i-faces in a that contain u. Also, V" denotes the set of vertices v in a that 
are included only in light z-subfaces of a. 

Proposition 4.3. Let a and a' be two consecutively chosen faces where a is 
non-primary and the extension step on a is wasteful. Then a' precedes a in 
the order of faces and \ V° \ > {Vfl + 1 where i is the smallest dimension for 
which | iff | > 0. 

Proof. Since the extension step on o is wasteful (and, in particular, not a 
saving step of type (iii)) and since all j-subfaces of a are light for j < i, every 
j-face in a U y is light. Moreover, every i-subface of a U y that contains y is 
light as well. 

We claim that a' = a\{u}U{y} where the vertex u of a maximizes rf(v). 
(Since \H?\ > 0, there are vertices v in o for which rf(v) > 0). 
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Notice that a has ( d ^) —r?{u) more light i-subfaces than does r" := a\u. 

Namely, \Lf\ = \L?\ -f^ 1 ) + r ?( u )- 

Combining the fact that every z-subface of o U y that contains y is light 
we see that in := t u U y, \Lf\ = \Lf\ + (^T 1 ) = |Lf| + rf(w). But since 
rf (it) > 0, Ty precedes a. In this case Ty must be a new face that does not 
belong to the previous complex, or else it would have been preferred over 
a. Being a new face, it is necessarily unsaturated. Since u maximizes r^(u) 
over all vertices in a, it follows that Ty precedes all other faces created in the 
expansion. Furthermore, no other face precedes a or else it would be chosen 
rather than a. Thus a' = t%, as claimed. Notice that y e V° and also 
V? C Vf (Note that every z-dimensional subfaces of a' that is not contained 
in a is light since it contains y). Thus \V° \ > \Vf\ + 1. □ 

Consider a chosen non-primary face a and let % be the smallest dimen- 
sion for which \H°\ > 0. The previous claim implies that after at most d 
consecutive wasteful steps the chosen face, 9 precedes a and \ V®\ = d. Then 
\Hj\ = for all j < % (in particular \Hf\ = 0). By repeating this argument 
d — 1 times we conclude that following every series of d(d — 1) consecutive 
wasteful steps, a primary face must be chosen: After at most d consecutive 
wasteful steps the chosen face can have no heavy vertices. At the end of the 
next d consecutive wasteful steps, the chosen face has no heavy vertices nor 
heavy edges. Repeating this argument (d — 1) times necessarily leads us to 
a chosen primary face. 

Proposition 4.4. Only saving steps can decrease the number of unsaturated 
primary faces. 

Proof. Let a = a± , . . . , be a primary face and let y be the vertex that 
expands it. Denote the (d — l)-face {ai, a 2 • . • , a i+ i, . . . , a^} U {y} by a 1 . 
Since this is not a saving step of type (i), y is light. It is also not of type 
(hi) and so \H% | = for every i = 1, . . . , d and k = 1, . . . , d — 2, so that 
faces a 1 are primary. However this is not a type (ii) saving step, so all the 
(d— l)-faces a % must be new. Thus the number of unsaturated primary faces 
has increased by at least d — 1. □ 

The proof of Claim 4.2 is now complete, since at each step at most d + 1 
faces get covered. □ 

We can turn now to bound the number of minimal core m-simplex com- 
plexes C m . As mentioned, we first specify the heavy (d — 2)-faces of C m by 
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specifying a set of out of the total of L^J (d — 2)-faces. Then we 

select the first simplex C\ and mark all its (d — l)-faces as unsaturated. In 
order to choose the i-th step we first decide whether it is a saving or wasteful 
step, and if it is a saving step, what type it has. There is a total of d + 1 
possible kinds of extensions of the current (d— l)-face: A saving step of type 
(i), (ii), or one of the d — 2 choices of type (iii) (according to dimension), 
or a wasteful step. In a saving step the expanding vertex can be chosen in 
at most d d bn ways. The number of possible extension clearly never exceeds 
n and it is this trivial upper bound that we use for wasteful steps. Finally 
we update the labels on the (d — l)-faces of a new simplex. We need to 
decide which of the unsaturated (d — l)-faces that are already covered by at 
least two simplices change their status to saturated. There are at most 2 d+1 
possibilities of such an update. As we saw, at least ^ of the steps in such 
process are saving steps. Consequently we get the following upper bound on 
Cd(n, m), the number of minimal core n- vertex d- dimensional complexes with 
m simplices. (In reading the expression below, note that the terms therein 
correspond in a one-to-one manner to the ingredients that were just listed). 



L}_i d _^Jn d+1 (d + l) m - l n m -\d d b)^{2 d+1 ) m 

< ( ,2 !^]n d n m (2 d+1 d 2 (d 2 S)^) m < '^]n d n m (2 d+1 d 3 S^y 
\{d 2 m) d J \{d z m) d J 



□ 



Proof of Theorem 4.1: We show the assertion with 5 = 5(c) = 
[2 d+2 d?c)~ di . Indeed, consider a complex drawn from Yd(n,p). Let X m = 
X m (n,p) count the number of minimal core subcomplexes with m simplices 
and which are not copies of dAd+i- Our argument splits according to whether 
m is small or large, the dividing line being m = mi = (d 3 logn) d . The theo- 
rem speaks only about the range m < m 2 = 5(c)n d . By Lemma 1, 
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m.2 m 2 m 2 



{d 2 m)' 



EX m < Cd(n,m)p m < ^ (n^ 1 )^™^ n d (2 d+l d 3 5^ a)' 

m=mi m=mi m=m\ 

r»2 d— 1 rn 2 / l 

m=mi m=mi ' 



m.2 / -| \ (d 2 m)~ 

= 0(1) 



m=mi 



It follows that with almost certainty no cores with m simplices occur 
where m 2 > m > We next consider the range d + 3 < m < m\. Note 
that a minimal core complex with m > d + 3 simplices has at most 4£§ 



vertices. Let A (it) denote the number of simplices that contain the vertex it. 
Clearly, if A (it) > then A(u) > d + 1 (Consider a simplex a that contains 
it. Every face of the form a \w with u ^ w G a is covered by a simplex 
other than a). It is not hard to verify that if some simplex a contains two 
distinct vertices with A (it) = A(w) = d+1 then the complex contains dAj+i 
contrary to the minimality assumption. Let t be the number of vertices it 
with A (it) — d+1. No simplex contains two such vertices, so that t < 
Counting vertices in the complex according to the value of A we get 

{d + l)t + (d + 2){v - t) < (d + l)m 

where v is the total number of vertices. The conclusion follows. 

The expected number of minimal core subcomplexes of Yd(n,p) that con- 
tain d + 3 < m < mi simplices satisfies 



mi mi / \ . rt , , mi 

E £ E UJ [(sn-j p] £ E 

m=d+3 m=d+3 y d+4 ' m=d+3 

™\ , lnp .2d(d+l)(rf+4) m 

< g ^+4^§ ^y +i = o( i). 



n d + 4 



m d+1 p 



m=d+3 



Consequently, a.a.s. Yd(n,p) contains no minimal core subcomplexes of m 
simplices with d + 3 < m < (2 d+2 d?c)~ di n d . 



□ 
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5 The Threshold for rf-Collapsibility 



For a complex Y C A n \ and a fixed r G A n _i(d — 1), define a sequence of 
complexes {<Sj(y)}i>o as follows. S (Y) = r and for % > 1 let <Sj(Y) be the 
union of iSj_i(F) and the complex generated by all the <i-simplices of Y that 
contain some rj G Si-i(Y)(d — 1). Let Td denote the family of all d-trees. 
Consider the events A^, D C y<2(n,p) given by 

A k = {S k {Y) G %} 

and 

-D = {deg y (?7) < logn for all rj G A n ^ 1 (d — 1)}. 

Claim 5.1. Let k and c > be fixed and p — -. Then 

PrL4 fe+1 n£>] = l-o(l). 

Proof: Fix 77 G A n _i(d — 1). The random variable deg y (?7) has a binomial 
distribution B(n — d, -), hence by the large deviations estimate 

Pr[deg y (r/) > logn] < n ~ n{l ° sl ° sn) . 

Therefore Pr[D] = 1 - o(l). UYeD then f (S k+1 (Y)) = 0(\og k+1 n) and 
/d-i(«5fc(V)) = 0(log fc n). Note that <Sfc + i(y) is a d-tree iff in its generation 
process, we never add a simplex of the form rjv such that both r\ G A n _i (d— 1) 
and v G A n _i(0) already exist in the complex. Since the number of such pairs 
is at most f (S k+ i(Y)) fo^S^Y)) it follows that 

PrL4 fc+1 n D] > (l - £)^(5* + i(y))/d-i(s*(y)) > 

n 

□ 

For y C A^_ x let r(Y) = fd(R 0O (Y)) be the number of d-simplices 
remaining in Y after performing all possible (^-collapsing steps. For r G 
A n _i(d - 1) let r(r) = {cr G A n _i(d) : cr D r}. 

Claim 5.2. Lei < c < 7^ 6e fixed and p — -. Then for any fixed r G 
A n _i(d): 

Pr[ J R 00 (y)nr(r) ^0] =o(l) . (15) 
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Proof: Let 5 > 0. Since c < 7^ 



lim p d (k,c) = p d (c) = 1. 

k— >oo 



Choose a fixed k such that 



p d (k,c) > 1 - -. 



Claim 5.1 implies that if n is sufficiently large then 

Pr[A k+1 n D] > 1 - -. 

Next note that if y G then <Sfc+i = iSfc+i(y) can be generated by the 

following inductively defined random process: Sq = r. Let < i < k. First 
generate T = Si and let U denote all r' G T(d — 1) such that distxiT, r') = i. 
Then, according to (say) the lexicographic order on U, for each r' G U pick 
J new vertices z±,...,zj according to the binomial distribution B(n — n', -), 
where n' is the number of vertices that appeared up to that point, and add 
the (i-simplices z\r' ', . . . , zjt' to T. Note that the process described above is 
identical to the d-tree process of Section 3, except for the use of the binomial 
distribution B(n — n', ^) instead of the Poisson distribution Po(c). Now if 
Y G Ak+i fl D then n' = 0(log fc+1 n) at all stages of this process. It follows 
that if n is sufficiently large then the total variation distance between the 
distributions Sk+i(Y) and Td(k+1, c) is less then |. Denote by C' d (k + 1, c) the 
event that Sk+i(Y) is in A^+i and collapses to r in at most k pruning steps. 
The crucial observation now is that if Y G C' d (k + 1, c) then i? 00 (F)nr(r) = 0. 
It follows that 

Pr[iU^)nr(r)^0]< 

(1 - PrL4 fc+1 n D}) + Pr[F G* C' d (k + 1, c)] < 
(1 - Pr[A k+1 n D}) + d TV (S k+ i(Y), T d (k + 1, c)) + (1 - Pi[C d (k + 1, c)]) < 

— I h - = 5. 

3 3 3 

□ 

Let 

Q{Y) = {re K-i{d - 1) : Roo(Y) H T(r) ^ 0} 

and let = For a family Q C A n _i(rf — 1) let w{Q) denote the 

set of all d-simplices a G A n _!(<i) all of whose (d — l)-faces are contained in 
Q. Using Claim 5.2 we establish the following 
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Theorem 5.3. Let 5 > and < c < 7^ be fixed and let p = -. Then 

Pv[f d (R 00 (Y))>8n d ]=o(l). 

Proof: Let < e = e(d,c,S) < 1 be a constant whose value will be fixed 
later. Clearly 

Pr[/ d ( J R 00 (y)) > 5n d ] < 

Pv[g(Y) > e5n d ] + Pv[g(Y) < e5n d & f d (Roo(Y)) > 5n d }. 

To bound the first summand note that E[g] = o(n d ) by Claim 5.2. Hence by 
Markov's inequality 

Pv[g(Y) > e5n d ] < (e5n d )- 1 E[g] = o(l). 

Next note that 

Pr[s(Y) < e5n d & f d (Roo(Y)) > 5n d ] < 
Pt[\w(G) nY(d)\ > 5n d }. 

{gcA n - 1 (d-iy.\g\=eSn d } 

Fix a Q C A n _i(d — 1) such that |£?| = e5n d . By the Kruskal-Katona theorem 
there exists a Ci = Ci(d, 5) such that 

iV= \w(G)\ < Cie^n d+1 . 

Applying the large deviation estimate for the binomial distribution B(N, -) 
and writing C2 = ^p- we obtain 

Pr[\w(G) n > < (C 2 e^) 5nd . 

On the other hand 





e5n d J v e5 



< i c \t8n d 



Choosing e such that 

, e . d+i _-, 
(^) C 2 e <* < e 
eo 

it follows that 

Pv[g(Y) < e5n d & fd(Roo(Y)) > 6n d ] < exp(-5n° 
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□ 



Proof of Theorem 1.4: Let c < 7^ and p — -. By Theorem 4.1 there exists 
a 5 > such that a.a.s. any non-<i-collapsible subcomplex K of Y G l^(n, -) 
such that fd{K) < 5n d contains the boundary of a (d+ l)-simplex. It follows 
that 

Pr [Y non d - collapsible | Y G F n , d ] = Pr [f d (R oc (Y)) > | Y G T n4 ] < 

Pr [f d (Roo(Y)) > 5n d ]-Pr [ Y G J^+Pr [0 < f d (Roo(Y)) <5n d \Ye F n4 \. 

The first summand is o(l) by Theorem 5.3, and the second summand is o(l) 
by Theorem 4.1. 

□ 

6 Concluding Remarks 

Let us remark that, estimating the vanishing probabilities by a function 
which tends to as n — > 00 faster that n~ d one may show a random process 
statement slightly stronger than Theorem 4.1 (see [4], where a similar result is 
shown for the fc-core of random graphs) . More specifically, let us define the d- 

dimesional random process y d = {Y d (n, M)}^q as the Markov chain whose 
stages are simplicial complexes, which starts with the full (d — l)-dimensional 
skeleton of A n _i and no d-simplices, and in each stage Yd(n, M+l) is obtained 
from Y d (n, M) by adding to it one <i-simplex chosen uniformly at random from 
all the (i-simplices which do not belong to Y d (n,M). Then, the following 
holds. 

Theorem 6.1. There exists a constant a = a(d) > such that for almost 

every d- dimesional random process y d = {Y d (n, M)}\^q there exists a stage 
M = M{y d ) such that the core of Y d (n, M) is of the size 0(1) and consists 
of boundaries of (d+l)-simplices, while the core of M = M{y d + 1) contains 
at least an d d-simplices. 

Many questions remain open. The most obvious ones are 

• What is the threshold for <i-collapsibility of random simplicial com- 
plexes in T ntd ? We conjecture that it is indeed p = r y d /n. 
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• Derive better upper bounds on the threshold for the nonvanishing of 
H d {Y). We intend to return to this subject in subsequent papers. 

• In particular does the threshold for the vanishing of Hd(Y) depend on 
the underlying field? 

• Although this question is implicitly included in the above two ques- 
tions, it is of substantial interest in its own right: Can you show that 
the two thresholds (for <i-collapsibility and for the vanishing of the 
top homology) are distinct? We have good reasons to think that the 
two thresholds are, in fact, quite different. In particular, although d- 
collapsibility is a sufficient condition for the vanishing of Hd, there is 
only a vanishingly small probability that a random simplicial complex 
with trivial top homology is ci-collapsible. 
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